Introducing Google's Gemma 3 QAT: A breakthrough in quantization-aware training that enables edge AI deployment with FP16-level performance at quarter memory usage.
Meta releases Llama 3.3: 70B parameter model matches 405B performance, with 128K context window and 8-language support. A breakthrough in open-source AI.